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ABSTRACT: 

With the COVID-19 pandemic, the wearing of face masks covering mouth and nose has become ubiquitous all 
around the world. This study investigates the impact of typical face masks on voice radiation. To analyze the trans- 
mission loss caused by masks and the influence of masks on directivity, this study measured the full-spherical voice 
directivity of a dummy head with a mouth simulator covered with six masks of different types, i.e., medical masks, 
filtering facepiece respirator masks, and cloth face coverings. The results show a significant frequency-dependent 
transmission loss, which varies depending on the mask, especially above 2 kHz. Furthermore, the two facepiece res- 
pirator masks also significantly affect speech directivity, as determined by the directivity index (DI). Compared to 
the measurements without a mask, the DI deviates by up to 7 dB at frequencies above 3 kHz. For all other masks, the 


deviations are below 2 dB in all third-octave frequency bands. © 2020 Acoustical Society of America. 


https://doi.org/10.1121/10.0002853 


(Received 14 August 2020; revised 30 October 2020; accepted 11 November 2020; published online 14 December 2020) 


[Editor: James F. Lynch] 


Il. INTRODUCTION 


In times of crisis, such as the COVID-19 pandemic, 
security measures affect human face-to-face communica- 
tion in many different ways. Speech intelligibility is 
impaired by greater physical distances between interlocu- 
tors, resulting in increased masking by background noise 
and a lower direct-to-reverberant ratio (DRR) at the listener 
position. In addition, face masks covering the mouth and 
nose cause transmission loss, reducing the energy radiated 
by the speaker. Depending on the frequency characteristics 
of the transmission loss, this can affect speech intelligibility 
(Palmiero et al., 2016; Radonovich et al., 2010). As the 
transmission loss of the masks may also vary with respect 
to the direction of radiation, the DRR could be further 
affected if, for example, the masks attenuate frontal sound 
radiation more than lateral radiation. Furthermore, other 
effects (e.g., lip reading) also strongly affect speech intelli- 
gibility (Matthews et al., 2002; Sumby and Pollack, 1954; 
Summerfield, 1992). 

So far, the acoustic effects of face masks have only 
been investigated in a small number of studies. Radonovich 
et al. (2010) and Palmiero et al. (2016) evaluated the influ- 
ence of respirator masks worn by healthcare workers on 
speech intelligibility. In these studies, the influence of masks 
on the speech transmission index (STI) was determined for 
certain room settings. Recently, Goldin et al. (2020) 
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analyzed three medical masks and found a low-pass charac- 
teristic attenuating frequencies above 2 kHz. 

To further investigate to what extent the acoustic effects 
of face masks affect voice radiation and consequently 
speech intelligibility, we present full-spherical voice direc- 
tivity measurements of a dummy head with a mouth simula- 
tor covered with six masks of different types, i.e., medical 
masks, filtering facepiece respirator masks, and cloth face 
coverings. We analyze the transmission loss caused by the 
masks as well as their influence on directivity. Although it is 
already clear that the transmission loss caused by the masks 
can reduce speech intelligibility, it has not yet been investi- 
gated how face masks affect speech directivity and, there- 
fore, possibly also speech intelligibility. Resonances of 
vibrating structures of the mask or cases where the mask is 
not completely closed at the sides may lead to reduced fron- 
tal sound radiation compared to the lateral or rear radiation, 
impairing speech directivity. Accordingly, when a speaker 
inside a room faces a listener, the reverberant energy caused 
by sound radiation in all directions is decreased less than the 
direct sound energy, resulting in a reduced DRR at the lis- 
tener position and possibly reduced speech intelligibility. 
Furthermore, the influence on the directivity could become 
relevant if the interlocutors do not look at each other directly 
but turn their heads slightly and thus radiate speech laterally, 
as is often observed in conversations. 


ll. MEASUREMENT PROCEDURE 


The directivity measurements were done with a HEAD 
acoustics HMS II.3 dummy head and mouth simulator (head 
width 14.0 cm; head height 22.5cm; head length 20.0 cm). 
However, as a mouth simulator cannot reflect phoneme- 
dependent effects, there are distinct differences between the 
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radiation of a dummy head and the human voice. For exam- 
ple, the mouth opening’s variable size plays an important 
role (Brandner et al., 2020), as well as sound radiation of 
the nasal passage and the position of the voice excitation in 
the vocal tract. These phoneme-dependent effects are one 
important aspect of the dynamic voice directivity which has 
been analyzed in several studies, e.g., in Kocon and Monson 
(2018), Katz and D’Alessandro (2007), Monson et al. 
(2012), and Porschmann and Arend (2020). While simula- 
tions could probably examine some aspects of voice radia- 
tion such as the mouth opening’s variable size, other aspects 
such as the physical contact between the mask and the 
vibrating lips are much harder to analyze. Despite these dif- 
ferences, a dummy head’s radiation generally covers typical 
spatial characteristics of human voice radiation (Halkosaari 
et al., 2005) and therefore provides a good approximation. 
Accordingly, we assume that the influence of the masks on 
voice radiation is comparable for a dummy head and human 
voice radiation. 

The measurements were performed in the anechoic 
chamber of TH Koln, sized 4.5m x 11.7mx2.30m 
(W x DxH), with a lower boundary frequency of about 
200 Hz. As shown in Fig. 1, the HEAD acoustics HMS II.3 
dummy head and mouth simulator was mounted on the 





FIG. 1. (Color online) Measurement setup with the HEAD acoustics HMS 
II.3 dummy head and mouth simulator mounted on the VariSphear measure- 
ment system (Bernschutz et al., 2010) and the omnidirectional Microtech 
Gefell M296S measurement microphone at a distance of 2m from the 
dummy head. 
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VariSphear measurement system (Bernschutz et al., 2010), 
which rotated the dummy head along a virtual sphere to the 
respective direction. In each direction, the excitation signal 
was played back over the mouth simulator and captured at a 
fixed position at a distance of 2m from the center of the 
dummy head with a Microtech Gefell M296S omnidirec- 
tional microphone. The loudspeaker (mouth simulator) of the 
HEAD acoustics HMS II.3 was driven by an Apart MB-150 
amplifier. An RME Babyface was used as AD/DA converter 
and microphone amplifier. The excitation signal was an 
emphasized sine sweep (2'8 samples at a sampling rate of 
48 kHz, length of 5.5 s). Impulse responses were measured 
for 2702 directions on a 44th order Lebedev sampling grid 
(Lebedev, 1977). In post-processing, adaptive low-frequency 
extension for frequencies below 200 Hz was applied to the 
impulse responses (Xie, 2009). Subsequently, the frequency 
and phase response of the loudspeaker (mouth simulator) 
was compensated by inverse finite impulse response (FIR) 
filtering with a frontal impulse response measured without a 
mask at azimuth ġ = 0° and elevation 0 = 0°, corresponding 
to a free-field equalization of the measurements. Finally, the 
impulse responses were truncated and windowed to a length 
of 128 samples at a sampling rate of 48 kHz. 

Except for the reference directivity measurement of the 
mouth simulator without a mask, the measurement procedure 
was repeated twice for each of the following face masks: 


(1) Disposable medical mask: Triple Layer Filter, dust- 
proof, non woven earloop, (manufactured in China, 
brandname: Arvin, Protection Class DS3). 

(2) Three-dimensional respirator mask, Protection Class 
KN 95 (manufacturer: Suzhou Jinruida Protective 
Equipment Co, Inc.). 

(3) Fine dust respirator mask, MB 21, Protection Class FFP 
2 (manufacturer: MB Filter Products AB, Savedalen, 
Sweden). 

(4) Microfibre scarf (manufacturer: Rose, material: 100% 
polyester). 

(5) Cloth face covering: Single layer cotton (manufacturer: 
Modeatelier Scharn, Engelskirchen, Germany). 

(6) Cloth face covering: Hand-made with two layers of cot- 
ton (manufacturer: Stoffliebe, Gelsenkirchen, Germany). 


According to the World Health Organization (2020), 
these masks (or more generally mouth and nose covers) can 
be categorized as medical masks (1), filtering facepiece 
respirator masks (2, 3), or cloth face coverings (4, 5, 6). 
Figure 2 shows the dummy head with the six tested masks. 


lll. RESULTS 


Every mask was measured twice. In between, it was 
taken off and put on again. We calculated the signed differ- 
ence between both measurements in dB averaged over all 
directions for frequencies below 8 kHz to analyze any possi- 
ble deviations. The deviations were only about 0.5 to 
0.8 dB, so we did not further investigate the effects of put- 
ting on and taking off the mask and simply refer to the first 
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FIG. 2. (Color online) HEAD acoustics HMS II.3 dummy head and mouth simulator with all the masks tested. 


measurements in the following. However, the publicly avail- 
able dataset contains all measurements.’ 


A. Transmission loss 


In the remainder, the directivity is denoted as p(Q, œ) 
with respect to the direction Q (¢, 0) and the angular frequency 
œ = 2nf, where fis the temporal frequency. ¢ denotes the azi- 
muth angle ranging from —180° to +180°, and 0 the elevation 
ranging from —90° to 90°, where 90° is at the top, and —90° 
at the bottom. The transmission loss T(Q, œ) between the ref- 
erence directivity Pref (Q, œ) without a mask, and the directiv- 
ity Pmask(Q, œ) with a mask, can be expressed as 


[Pret (Q, @) j 
[Pmask (Q, )| 





T(Q, œ) = 10lg (1) 


7: 


Figure 3 (left) shows the results for frontal sound radia- 
tion Qo =(¢ = 0°, 0 = 0°). Up to 2kHz, the transmission 
loss is relatively low and flat for most masks but increases 
rapidly for masks 2 and 3 above 2 kHz with peaks of about 
15 dB between 3 kHz and 5 kHz. Mask 6 shows a completely 
different frequency-dependent shape of the transmission 
loss. It exhibits a similar increase above 2 kHz, but already 
relevant transmission loss at low frequencies and a first 
strong peak at about 900 Hz. 
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(5) 


For further analysis, we calculated Tayg(@) by averag- 
ing the transmission loss over all radiation directions, Q, 


1 e Pret (Q4, w)|? 
Ji 
Q q=1 |[Pmask (Q4, «)| 





Tave(@) = 10lg (2) 


Figure 3 (right) shows the results for the average transmission 
loss. In general, a similar trend can be observed as for the fron- 
tal sound radiation. However, for the medical respirator masks 
(masks 2 and 3), the peaks vary slightly with the radiation 
direction, and the shape of the curves becomes smoother and 
has less prominent peaks due to the averaging over direction. 
In contrast, for the other masks, the differences between 
Fig. 3, left and right, are much smaller, indicating that the 
peaks are rather independent of direction. Masks 1, 4, 5, and 6 
show two peaks at about 900 Hz and 4kHz, whereas for mask 
4 the peaks are below 3 dB, and for masks 1 and 5, they are 
below 6dB. Mask 6 exhibits peaks of more than 10dB. 
Furthermore, for mask 6, the transmission loss is already above 
3 dB at low frequencies. This may be due to the structure of 
this mask consisting of two layers of thick cloth. 


B. Directivity analysis 


To analyze the directivity in the horizontal and vertical 
plane, the 2702 measured impulse responses were 
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FIG. 3. (Color online) Transmission loss T(Qo,@), 1/12 octave smoothed, of the different masks for frontal sound radiation Qo = (ġ = 0°, 0 = 0°) (left), 


and averaged over all directions Tavg(%) (right). 


transformed to the spherical harmonics (SH) domain 
(Williams, 1999) at an SH order N = 35 and resampled to 360 
directions in 1° steps along the horizontal plane [ġ = —180° 
to 180° (where positive angles pointing to the left), 0 = 0°], 
and along the vertical plane [ġ = 0°, 0 = —180° to 180° 
(where 0° points to the front, 90° to the top, and 180° to the 
back)] using the inverse SH transform. Figures 4 and 5 show 


Front Front 






































Back Back 


4 kHz 5 kHz 





polar plots of the horizontal and vertical directivities in third- 
octave bands. We only present the directivity patterns for the 
frequency bands above 500 Hz, as we could not observe any 
relevant influence of the masks on directivity for lower fre- 
quencies. For the third-octave bands up to 1.25 kHz, the direc- 
tivities vary between the masks less than 1 dB, and thus, barley 
differ from the reference (black curve). Only for the directivity 
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FIG. 4. (Color online) Directivity patterns in the horizontal plane (p = —180° to 180°, 0 = 0°) for all masks in third-octave bands. 
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FIG. 5. (Color online) Directivity patterns in the vertical plane (¢ = 0°, 0 = —180° to 180°) for all masks in third-octave bands. 


of mask 6 can some differences be observed, e.g., of 1 dB in 
the 630 Hz band at 60° in the vertical plane (Fig. 5). Above 
1.25 kHz, the differences rise in magnitude with increasing 
frequency. As with the transmission loss, the filtering face- 
piece respirator masks (masks 2 and 3) exhibit the most 
prominent differences to the reference. In the 3.15 kHz 
band, which covers the same frequency range for which we 
observed the maximum transmission loss for the two masks, 
significant directivity variations occur. Furthermore, the 
directivities of masks 2 and 3 vary strongly for adjacent fre- 


Appendix for a further illustration of the influence of the 
masks on sound radiation in the horizontal and vertical 
plane with respect to frequency. 


C. Directivity index (Dl) 


For a more detailed analysis of the directivity, we deter- 
mined the DI for spherical sound radiation, which can be 
calculated as 


4r|p(Qm,o)|? 











quency bands. For example, the directivity at 4kHz is much DI = 101g 77) ; (3) 
broader for mask 3 than for the other masks, while it is |p(Q, a) |? cos dôd 

directed stronger to the front at 5kHz. Please refer to the =n J —n/2 

TABLE I. DI in the third-octave bands related to the frontal direction Qn. 

Mask type 630 Hz 800 Hz 1kHz 1.25 kHz 1.6kHz 2kHz 2.5kHz 3.15 kHz 4kHz 5 kHz 6.30 kHz 8 kHz 
No mask 3.22 1.11 0.42 1.86 4.99 6.60 3.72 3.54 4.97 4.00 4.47 5.69 

Mask 1 3.43 1.18 -1.35 1.54 4.76 6.78 3.96 2.53 4.57 2.38 3.20 4.82 

Mask 2 3.52 1.40 0.44 1.18 4.55 6.50 4.21 0.42 0.93 4.12 4.93 7.21 

Mask 3 3.23 1.26 0.48 2.09 4.75 6.14 4.88 2.55 2.29 7.86 9.65 6.47 

Mask 4 3.38 0.93 0.36 2.20 5.04 6.60 3.47 3.85 5.05 4.41 4.63 6.11 

Mask 5 3.36 1.07 -1.22 1.58 4.69 6.57 3:65 2.79 4.43 2.80 3.33 522 

Mask 6 3:55) 1.46 -1.10 1.84 4.66 6.52 3.99 3.63 4.95 2.53 3.39 5.79 
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FIG. 6. (Color online) DI, 1/12 octave smoothed, of the different masks. 


where Qm denotes the main direction of sound radiation. 
Usually, the direction with maximum magnitude of radia- 
tion of the human mouth is inclined slightly downwards 
(Marshall and Meyer, 1985), which can also be seen in 
Fig. 5. However, we compensated the measurements for 
the frontal direction (Qo), and furthermore, the frontal 
direction is included in all directivity plots. Therefore, we 
decided to calculate the DIs related to the main direction 
Omn = Qo. Table I summarizes the DIs for the third-octave 
bands between 630 Hz and 8 kHz. Apart from masks 2 and 
3, the differences of the DI to the reference remain below 
1 dB for frequencies up to 4kHz and below 2 dB for fre- 
quencies up to 8kHz. Again, the two facepiece respirator 
masks, masks 2 and 3, have the largest influence. In par- 
ticular, between 3.15 and 6.3 kHz, the DI varies signifi- 
cantly. The differences are most prominent in the 4kHz 
band, exceeding 4dB (mask 2) and 7 dB (mask 3). This is 









supported by Fig. 6, which shows the DI with respect to 
frequency. In the frequency range from 3 to 5 kHz, the DI 
is significantly decreased by masks 2 and 3. At about 
5 kHz the DI of mask 3 sharply rises towards high fre- 
quencies and is about 5 dB higher than the DI of all other 
measured datasets. The deviations of the DIs for the two 
facepiece respirator masks (masks 2 and 3) match the 
observations from Sec. HIB, where we found the largest 
directivity variations for the same masks at the same fre- 
quencies (Figs. 4 and 5), and also the findings from Sec. 
III A, where we found the most considerable transmission 
loss for the corresponding masks in similar frequency 
regions. 


IV. CONCLUSION 


This paper presented measurements and analyses of 
sound radiation from a dummy head and mouth simulator 
wearing various face masks, i.e., medical masks, filtering 
facepiece respirator masks, and cloth face coverings. As dis- 
cussed in Sec. II, a dummy head reproduces the typical spa- 
tial structure of human voice radiation, even though several 
aspects of human voice radiation like, e.g., the dynamic 
directivity, cannot be considered. 

All examined masks result in a noticeable transmission 
loss at frequencies above 2 kHz. Thus, the transmission loss 
of the masks affects relevant frequency components of 
speech transmission, and therefore certainly impairs speech 
intelligibility. While in the frontal direction for the facepiece 
respirator masks (masks 2 and 3) the transmission loss 
increases strongly above 3 kHz by up to 15 dB, for the medi- 
cal mask and most of the cloth face coverings, the transmis- 
sion loss remains below 6dB for frequencies above 3 kHz. 
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FIG. 8. (Color online) Transmission loss T(œ) in the vertical plane with respect to frequencies from 400 Hz to 10 kHz. 


An exception is a hand-made cloth face covering (mask 6), 
which consists of two thick layers of cotton, showing a 
much higher transmission loss already at low frequencies. 
Our results are in line with the results of Goldin et al. 
(2020). The study analyzed three different masks and deter- 
mined an attenuation from 4 to 6dB for a simple medical 
mask and an attenuation of 12 dB up to 18 dB for the higher 
protective N95 masks in the frequency range between 3 and 
7kHz. 

The analysis of the directivity showed that the masks 
affect speech directivity differently. Both tested facepiece 
respirator masks lead to an increase of the DI by up to 7dB 
between 3 and 5 kHz, and one of them to an increased DI for 
5 dB above 5 kHz. The other masks show only a weak influ- 
ence on the directivity, affecting the DI by a maximum of 
1 dB for frequencies up to 4kHz and 2 dB for frequencies up 
to 8kHz. The DI can be directly related to the DRR, for 
which Larsen et al. (2008) determined just-noticeable dif- 
ferences (JNDs) of about 2—3 dB in rooms with a DRR of 0 
or +10dB and JNDs of about 6-8 dB in rooms with a DRR 
of —10 or +20 dB. For some frequency bands, the change of 
the DI caused by the facepiece respirator masks is in the 
range of the JND. Therefore, the facepiece respirator masks 
might impair speech intelligibility in rooms. However, as 
the DRR is a broadband measure, it is hard to predict how a 
decrease of the DI in specific frequency bands affects speech 
intelligibility. It can be assumed that especially phonemes 
incorporating significant energy in the affected frequency 
range might show degraded intelligibility. The datasets from 
this study can be applied in follow-up studies similar to 
Fogerty et al. (2020) or Kokabi et al. (2019) using room 
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acoustic simulations or virtual acoustics to analyze in more 
detail which way the masks affect speech intelligibility. 
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APPENDIX 


Figures 7 and 8 show the transmission loss with respect 
to frequency in the horizontal and vertical plane for a fre- 
quency range of 400 Hz to 10 kHz. The figures illustrate the 
directional dependency of the transmission loss and support 
the observations from Secs. HIA and IIC. In particular, 
masks 2, 3, and 6 exhibit notable transmission loss, as indi- 
cated by the deep blue color. 


‘AI directivity datasets are available in SOFA format under a Creative 
Commons CC BY-SA 4.0 license and can be downloaded at https:// 
doi.org/10.528 1/zenodo.3952320. The datasets will be published with a 
DOI on https://zenodo.org after acceptance of the paper. 
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